Improving the Efficiency of Clustering by Using an Enhanced Clustering Methodology
نویسندگان
چکیده
Clustering in data analysis means data with similar features are grouped together within a particular valid cluster. Each cluster consists of data that are more similar among themselves and dissimilar to data of other clusters. Clustering can be viewed as an unsupervised learning concept from machine learning perspective. In this paper, we have proposed an Enhanced Clustering Methodology to obtain better clustering quality with much reduced complexity. We have evaluated the performances of the classical K-Means approach of data clustering, it’s modified Global K-Means, an Efficient K-Means and the proposed Enhanced K-Means method. The accuracy of all these algorithms were examined taking several data sets from UCI [21] repository of machine learning databases. Their clustering efficiency has been compared in conjunction with two typical cluster validity indices, namely the Davies-Bouldin Index and the Dunn’s Index for different number of clusters, and our experimental results demonstrated that the quality of clustering by proposed method is much proficient than the other mentioned K-Means based algorithms when larger data sets with more number of attributes are taken into consideration. Apart from this it has been found that, the computational time for clustering determined by the proposed algorithm is much lower than the other discussed methods.
منابع مشابه
Improving Imbalanced data classification accuracy by using Fuzzy Similarity Measure and subtractive clustering
Classification is an one of the important parts of data mining and knowledge discovery. In most cases, the data that is utilized to used to training the clusters is not well distributed. This inappropriate distribution occurs when one class has a large number of samples but while the number of other class samples is naturally inherently low. In general, the methods of solving this kind of prob...
متن کاملImproving Lifetime of Strategic Information Network in Oil Supply Chain
Today, information networks play an important role in supply chain management. Therefore, in this article, clustering-based routing protocols, which are one of the most important ways to reduce energy consumption in wireless sensor networks, are used to optimize the supply chain informational cloud network. Accordingly, first, a clustering protocol is presented using self-organizing map neu...
متن کاملImproving Lifetime of Strategic Information Network in Oil Supply Chain
Today, information networks play an important role in supply chain management. Therefore, in this article, clustering-based routing protocols, which are one of the most important ways to reduce energy consumption in wireless sensor networks, are used to optimize the supply chain informational cloud network. Accordingly, first, a clustering protocol is presented using self-organizing map neu...
متن کاملImproving Vehicular Ad-Hoc Network Stability Using Meta-Heuristic Algorithms
Vehicular ad-hoc network (VANET) is an important component of intelligent transportation systems, in which vehicles are equipped with on-board computing and communication devices which enable vehicle-to-vehicle communication. Consequently, with regard to larger communication due to the greater number of vehicles, stability of connectivity would be a challenging problem. Clustering technique as ...
متن کاملImproving Accuracy in Intrusion Detection Systems Using Classifier Ensemble and Clustering
Recently by developing the technology, the number of network-based servicesis increasing, and sensitive information of users is shared through the Internet.Accordingly, large-scale malicious attacks on computer networks could causesevere disruption to network services so cybersecurity turns to a major concern fornetworks. An intrusion detection system (IDS) could be cons...
متن کامل